In [2]:
from IPython.display import Image, display
In [3]:
Image("images/drew_conway_venn.png", width=400)
Out[3]:
In [4]:
Image("images/bigdataborat_venn.png", width=400)
Out[4]:
Maybe...
A Data Scientist is a statistician who lives in San Francisco
...or...
Data Science is statistics on a Mac
...or...
A Data Scientist is someone who is better at statistics than any software engineer and better at software eigneering than any statistician.
A data scientist is someone who can obtain, scrub, explore, model and interpret data, blending hacking, statistics and machine learning. Data scientists not only are adept at working with data, but appreciate data itself as a first-class product
Hey wait, haven't we been doing data science of hundreds of years?
The 2013 O'Reilly Data Science Salary Survey: Tools, Trends, What Pays (and What Doesn’t) for Data Professionals.
In [5]:
Image("images/tool_usage.png", width=600)
Out[5]:
What is scientific about data science? Here is my own take:
Data science involves the application of scientific methodologies to data sets that lie outside the traditional realms of science.